[Relay][QNN] Support for non scalar zero points in qnn.conv2d #8620

jwfromm · 2021-08-02T18:30:51Z

This PR adds support for non-scalar zero point values in qnn conv2d operators and also allows the kernel zero points to be channel-wise. This is a needed change to support ONNX's ConvInteger nodes which typically treat zero points as expressions and are often generated using OnnxRuntimes quantization feature, that produces channel wise zero points. Although the rest of the qnn framework doesn't yet support non constant zero points, this is a good start that improves our onnx coverage considerably.

I also found that although qnn supported lowering uint8 convolution and dense to cuda, the dp4a instruction actually only supports int8 datatypes, an error exposed by the onnx frontend tests. I added some legalization logic to convert uint8 to int8 when the target is cuda.

jwfromm · 2021-08-02T18:31:12Z

@anijain2305 @mbrookhart what do you guys think of this change?

mbrookhart

LGTM

…#8620) * conv2d working, fixing conv2d_depthwise * Depthwise conv2d working. * Make convinteger work on cuda. * Simplify code and add tests. * Formatting. * Fixed fallback broadcasting. * Fix fallback broadcasting. * Formatting. * Fix lint * Merge with new test parameterization.

Josh Fromm and others added 8 commits July 28, 2021 22:25

conv2d working, fixing conv2d_depthwise

74c7c26

Depthwise conv2d working.

1b9812a

Make convinteger work on cuda.

1ffa74c

Simplify code and add tests.

9ea730e

Formatting.

cee02bc

Fixed fallback broadcasting.

1a8ea0f

Fix fallback broadcasting.

5a89d71

Formatting.

a4d1b62

jwfromm requested review from anijain2305, areusch, comaniac, jroesch, junrushao, merrymercy, tqchen, yzhliu, zhiics and ZihengJiang as code owners August 2, 2021 18:30

Fix lint

ada06e0

mbrookhart approved these changes Aug 2, 2021

View reviewed changes

Josh Fromm added 2 commits August 6, 2021 15:35

Merge into main.

8277bdb

Merge with new test parameterization.

8f7e048

masahi merged commit 11238b5 into apache:main Aug 7, 2021

junrushao mentioned this pull request Nov 1, 2021

Apache TVM v0.8 Release Note Candidate #9416

Closed

jwfromm deleted the qnn_checkpoint branch April 12, 2023 15:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Relay][QNN] Support for non scalar zero points in qnn.conv2d #8620

[Relay][QNN] Support for non scalar zero points in qnn.conv2d #8620

jwfromm commented Aug 2, 2021

jwfromm commented Aug 2, 2021

mbrookhart left a comment

[Relay][QNN] Support for non scalar zero points in qnn.conv2d #8620

[Relay][QNN] Support for non scalar zero points in qnn.conv2d #8620

Conversation

jwfromm commented Aug 2, 2021

jwfromm commented Aug 2, 2021

mbrookhart left a comment

Choose a reason for hiding this comment